16:11
2026-06-25
huggingface.co
large-language-models
Which tokens does a hybrid model predict better?
Researchers at the Allen Institute for AI compared their 7B transformer model Olmo 3 with the hybrid model Olmo Hybrid to determine which tokens each predicts better. The hybrid model excels on meaninβ¦